A Single Author Style Representation for the Author Verification Task
نویسندگان
چکیده
This paper presents our experience implementing three approaches for the ‘PAN 2014 Author Identification’ [3,1] task using the same representation for the author’s style. Two of our approaches extend previous successful approaches: naive Bayes [4] and impostor [8] methods. The third approach is based on original research on sparse representation for text documents. We present results with the official development and test corpora.
منابع مشابه
Author Verification Using Syntactic N-grams: Notebook for PAN at CLEF 2015
This paper describes our approach to tackle the Author Verification task at PAN 2015. Our method builds a representation of an author’s style by using the information contained in dependency trees. This information is represented as syntactic n-grams and used to conform a vector space. Using unsupervised machine learning approach, each instance is associated to the correponding author using the...
متن کاملStyle-based Distance Features for Author Verification Notebook for PAN at CLEF 2013
In this paper we present the approach we took in our participation to the PAN 2013 Author Profiling task. It is an adaptation of our system submitted for author identification, assuming that a profile category (authors belonging to the same gender and age group categories) can be analyzed in the same way as an author’s style.
متن کاملThe Crisis of Representation in Azadeh Khanoom and Her Author by Reza Baraheni
The crisis of representation is a topic widely discussed in critique and theory of postmodern literature. This refers to the crises of the present era including the crisis of meaning, the perplexity of contemporary humankind amidst a mass of valid and invalid data, alienation, etc. Literature, as the epitome of human life, is a reflection of these crises in the contemporary era. Azadeh Khanoom ...
متن کاملRandom Forest with Increased Generalization: A Universal Background Approach for Authorship Verification
This article describes our approach for the Author Identification task introduced in PAN 2015. Given a set of documents written by the same author and a questioned document with an unknown author, the task is to decide whether the questioned document was written by the same author as the other documents or not. Our approach uses Random Forest and a feature-encoding scheme based on the Universal...
متن کاملTraining of Foreign Students in the Academic Russian Letter
For the foreign student it is very important to own skills of the academic letter. It is an important indicator of professional and research competence of the student. The mobility of foreign graduates of the Russian higher education institutions depends on their level of proficiency in the academic Russian letter as communications between the educational and scientific organizations extend at ...
متن کامل